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10 SEARCH METHOD AND SYSTEM AND SYSTEMS USING THE SAME 

Field of the TnvPnti'rtn 

The present invention relates generally to data searching methods and 
systems and, more particularly concerns systems utilizing them. 

15 Background of the Inventinn 

The Ihtemet, and particularly the Worldwide Web. has caused a virtual 
infonnation explosion. An average user, making use of a conventional web browser, now 
has available to him a mass of infomiation that would have been unimaginable just a few 
years ago. This includes infonnation available from professional and commercial sources, 
individuals, and message boards or forums, where users "congregate" to discuss every 
imaginable topic, and some that are not. With the wealth of infonnation that is available, a 
new problem has arisen: How can that infonnation be found? 

This problem has been addressed by a plethora of "search engines", which 
are software programs and infonnation systems that are specifically designed to assist 
users in finding infonnation. While existing search engines have been adequate, they are 
limited in their ability to uncover useful infonnation when users are searching. The 
primary reason is that search engines tend to be language based, and a searcher is not 
always familiar with the common temiinology in his field of search. Also, there may be 
useful infonnation available which does not confomi to the common tenninology. It also 
takes substantial skill or experience to fomiulate queries that will produce meaningful 
results. 

In accordance with the present invention, search results are achieved that 
are broader and more intelligent than basic keyword searching. This is achieved by 
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imposing a structure on data being searched and utilizing the same structure for search 
queries. Relevant information is then uncovered by correlating the structure of the data 
being searched and the structure of the query. Items to be searched can include anything: 
messages, discussions, articles, polls, transcripts, or anything else that can be linked to or 
5 pulled from a database. Search results can be included that are less than 100% relevant, 
and not just 100% relevant. In the absence of, or in addition to. results that would he 
generated by a Boolean keyword-only-search, users can retrieve results of some relevance, 
for example as detennined by a set of selectable filter criteria. Consequently, merchants 
can sell inventory which might otherwise be unseen and/or users can find infonnation 
10 which might otherwise stay hidden in an overly strict Boolean search. 

The method of the present invention is the glue that holds online speakers 
together as they seek to use the Worldwide Web to communicate as they do in life. It lets 
users speak without seeing the spam that fills most message boards; aUows interesting 
conversations to take place without interruption; and gives users the anonymity to talk 
15 candidly without fear that their identities may be revealed. 

Where message board sites or forums are concerned, the present invention 
transforms ordinary sites into profitable "para-sites." Para-sites are sites that feed off the 
work of their own users. A para-site powered by the present invention collects interestmg. 
relevant infomiation by harnessing users to post and organize content, at no cost to the 
20 site-operator. Methods and systems embodying the present invention will hereafter be 
referred to by use of the assignee's trademark TRANSPARENSEE™. Users fmd sites 
stickier than other sites because of the high quality of information generated by the present 
invention. Site owners can restrict access to this infomiation in different ways, allowing 
the most valuable information to be repackaged and resold to different markets at different 
25 price points. 

As repositories of filtered information, TRANSPARENSEEtm sites attract 
users with specific interests. Users who speak inteUigently about subjects they know soon 
find that their opinions on that subject carry more weight - and are heard by more people - 
than the opinions of others. The weight given to a particular user's thoughts on a subject 
30 IS quantified as the user's "reputation" for knowing that subject. 

TRANSPARENSEEtm sites allow users to develop and maintain complex, 
multi-variable reputations for a wide variety of different subjects. As users develop high 
reputations for knowing a particular subject, they gain privileges on the site as a result; as 
they gain privileges, their investment in the site grows. High-reputation use« become 
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reluctant to move conversations off-site because, by leaving, they'll lose the benefits 

they've gained as high-reputation users. 

As a result, high-reputation users tend to remain on TRANSPARENSEE^m 

sites, and communities develop. These communities are deeply rooted in the site due to 

the investments their members have made by building reputations. For this reason. 

community members (and communities) cannot easily be lured away to non- 

TRANSPARENSEE™ sites. 

A sticky community of experts sharing information in a highly accessible 
way attracts new users. New users generate content, develop reputations, and become 
community members, thus adding to the attractive pull of the community. 

These network effects feed upon themselves, building small communities 
into large ones. The larger a community grows, the more infomiation it has under 
discussion, the greater the number and expertise of its usera. and the stronger a pull it 
exerts on new members. When a community grows large enough and vibrant enough, it 
becomes the only logical place for a new user to go in order to learn about or discuss a 
subject. 

Because the present invention makes it easier for people to communicate, 
sites that use the present invention quickly attract users. As these users gain reputations 
they develop into communities that are hard to displace. Network effects cause these 
communities to grow quickly. Taken together, this means that the first company to use the 
reputation feature of the present invention in any particular market has a substantial first- 
mover advantage. The bulk of users in that market will end up on TRANSPARENSEE™ 
sites, and will form deep-rooted communities 

TRANSPARENSEETM site reputations are portable. Reputation values are 
stored at and administered from a central location, allowing users to carry their reputations 
with them from TRANSPARENSEE™ site to TRANSPARENSEE™ site. In other 
embodiments, reputation values are stored in a partly or wholly distributed fashion. 

As the number of TRANSPARENSEE™ sites grows, the company's 
proprietary database of reputations also grows. When this database has reached a critical 
mass it will have tremendous value. Companies that choose to power their sites with the 
present invention will automatically become members of the TRANSPARENSEE™ 
Network, allowing them access to a large user base of individuals who may start using 
their pre-built reputations on the new site right away. 
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By allowing Web-site operators to inexpensively gather and distribute 
mstder speech." the present invention fills a demand which, though strong, has not been 
met by any other product. TH. invention is equally unique in the way that it allows 
hcensees to precisely target users based on detailed information without invading their 
individual privacy. 

The present invention provides several immediate benefits. It promotes the 
disclosure of superior information, then ranks and organizes that information in a way that 
allows It to be easily packaged and sold to different audiences at different price points It 
makes sites stickier while at the same tune allowing licensees to provide advertiser, with 
far more narrowly targeted advertisements than they otherwise could, substantially 
.ncreasmg advertising revenues. And it allows companies to lessen (or eliminate) the cost 
of hiring moderators to monitor online discussion. 



Brief De.scripttnn n f the Drawinp g 

The foregoing brief description, as well as further objects, features, and 
advantages of the present invention will be understood more completely from the 
followmg detailed description of a presently preferred, but nonetheless illustrative 
embodiment, with reference being had to the accompanying drawings, in which: 

Fig. 1 illustrates an embodiment of a typical static system wherein boards 
are grouped by firms, industries and topics; 

Fig. 2 illustrates one embodiment of a system nmning utilizing the 
rqjutation aspect of the present invention; 

Fig. 3 illustrates examples of relationships; 
Fig. 4 illustrates an embodiment of a simple dynamic system; 
Fig. 5 illustrates an example of selected categories of content and user 
selected categories being used as inputs to generate relevances; 

Fig. 6 illustrates an embodiment of a complex dynamic system; 
Fig. 7 illustrates an example flow chart for updating a user's rating; 
Fig. 8 shows an example of calculating an aggregate reputation; 
Fig. 9 illustrates an embodiment of threshold filtering wherein a palette 
contains a scatteiplot. And each dot represents s message- 
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Fig. 10 illustrates an embodiment of a scatterplot wherein the user has 
chosen to view messages of high message quality without much regard to the reputation of 
the poster; 

Fig. 11 illustrates an embodiment of a scatterplot wherein the user has 
chosen to view messages posted by users with high reputations without much regard to 
message quality; 

Fig. 12 illustrates an embodiment of a scatterplot wherein the user has 
chosen to view messages of high quality written by people with high reputations; 

Fig. 13 illustrates an embodiment of a scatterplot wherein the average 
combination of reputation and message rating is selected by users of a certain filter set; 

Fig. 14 illustrates an embodiment of related filters; 

Fig. 15 illustrates an example fiow chart of annotation posting; 

Fig. 16 illustrates an embodiment of tagged content.; 

Fig. 1 7 illustrates an embodiment of annotated tagged content; 

Fig. 18 illustrates an example flow chart of posting at different levels of 

anonymity; 

Fig. 19 illustrates key features of different levels of anonymity; 
Fig. 20 illustrates an example of onion routing; 

Fig. 21 illustrates an example of determining a discussion rating based on 
multiple factors; 

Figure 22 is a functional block diagram illustrating the preferred 
environment for the present invention; 

Figure 23 is an exemplary partial screen shot presented to a searcher in the 
dating service database; 

Figure 24 is a screen shot representing the results of an exemplary search; 

Figures 25a and 25b. together illustrate the results of an enhanced search; 

Figures 26a and 26b are screen shots of a page of the online dating service 
which pennits a searcher to review a candidate's long answers and a summary of the 
multiple choice answers; 

Figure 27 is a screen shot of a summary page for a user; 

Figure 28 is a multi-level tree representing a category with a hierarchical 

structure; 

Figure 29 illustrates a scalar category as represented by a tree with a single 

top node; 
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Figure 30 is a tree diagram illustrating a process for determining relevance 
of a category having a hierarchical data structure; and 

Fig. 31is a tree diagram illustrating a process for detennining the relevance 
value of a category having a scalar structure. 



Detailed Descriptmn n f the Preferred Embodiments 

Figure 22 is a functional block diagram illustrating the preferred 
environment for the present invention. A plurality of users* computers U access a content 
server C via a network I, preferably the Internet. Server C provides the users U access to a 
content database CD. Database CD may provide various types of information. For 
example, it may maintain the information used by an online dating service. Alternatively, 
it could provide the information for a restaurant survey service or wine survey service, or 
numerous other special interest services. Database CD could also include, in addition to 
surveys, product reviews and articles of interest on various subjects. 

Also connected to the network I is a web server W which cooperates with a 
system S. in accordance with the present invention, to manage users' access to information 
in database CD. Within system S, a query and search module 20 in accordance with the 
present invention interfaces with users, permitting them to formulate requests for 
information from database CD. M odule 2 0 c reates. m anages and m aintains a structure 
database 10. which contains information describing the structural relationship between 
various pieces of information in database CD. Database 10 also contains information 
relating to the structural relationship between various portions of information in a query in 
a format comparable to the structural relationship of information in database CD. 

In accordance with the present information, information in the database 10 
is used to correlate the data structure of a query to the structure of database CD. in order to 
determine that information in database CD which needs to be provided to a user in 
response to a query. Server W then connects the user to server C. with instructions to 
server C regarding what information is to be provided to the user from database CD. 

In some embodiments of the invention, system S also includes a user 
information module. This module is particularly useful in systems in which users access 
information in database CD which has been provided by other users. Module 30 could 
then, for example, include information about the reputation of various users with respect to 
the information which they have furnished. A user accessing information in database CD 
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Which has been provided by other users is then able to gauge the rdiabihty of that 
information. 

Those skiUed in the art will appreciate that the functions of servers C and 
W could be combined in a single server. Alternatively, server W and system S could 
accommodate access to different, independent content databases CD relating to different 
subject matter. The user could thereby be offered access to infomiation in a plurality of 
databases of different content through a single query generated via web server W. 

The invention will best be understood through the detailed description of a 
number of preferred embodiments. In accordance with a first embodiment, a dating 
service is provided in which persons seeking potential mates (candidates) populate a 
database with infomiation relating to themselves. Potential mates (searchers) can then 
access that database, providing various search criteria, in order to locate appropriate, 
potential mates. Those skilled in the art will appreciate that a similar model is applicable 
for numerous other services, such as, employment agency services. 

Figure 23 is an exemplary screen shot presented to a searcher in the dating 
service database. The searcher is presented with a plurality of multiple choice menus 40 
from which he is to select desirable traits of a potential mate. For example, the top three 
menus on the left of Fig. 23 relate to the gender, height and weight of a potential mate, 
while the top three menus 40 on the right relate to the age, marital status and education of 
the potential mate. A searcher need not make a selection in every menu 40. but only those 
which he considers important. Upon making those selections, the searcher clicks on the 
search button 42, and the search commences. Although not shown specifically on this 
screen, the searcher may be offered an opportunity to assign a relative weight to the 
different menus prior to activating the search. 

Figure 24 is a screen shot representing the results of an exemplary search. 
In this case, the user has made selections in menus 40 relating to gender, age, height, 
martial status, weight, education, eye color, and hair color. That search has produced two 
candidates. Heidildtch and Bobou. both of which are exact matches to the selected criteria. 

In this embodiment, a searcher is also able to dick on the button 44 in order 
30 to obtain an enhanced search. 

Figures 25a and 25b. together illustrate the results of an enhanced search, 
m addition to the two exact matches, there are a number of approximate matches. For 
example. "Landdecker" has a weight in excess of the selected category, but otherwise 
matches. Similarly, starting with "Helena", the weight is below the selected range. 
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Similarly, the remaining entries in Fig. 25a all relates to weight which are in excess of the 
selection and Fig. 25b relates to candidates which are older. The present invention is 
therefore able to locate matches which are close, but are not exact. The candidates are 
listed in decreasing order of relevance as defined by the user's selected criteria. The 
listing of users with different weights above those which are older reflects a relative higher 
menu weighting imposed on the weight sub-category than on the age sub-category. 

In accordance with the present embodiment, a candidate also provides long 
answers to preset questions. Figures 26a and 26b are screen shots of a page of the online 
dating service which permits a searcher to review a candidate's long answers (Fig. 26a) 
and a summary of the multiple choice answers (Fig. 26b). In the column 50 of Fig. 26a. 
the searcher is also offered a list of the candidates most similar to this one. At this point,' 
the searcher may click on any of the other candidates in column 50, and he will be able to 
access the data for that candidate. 

For example, should the user click " LubaO" i n c olumn 5 0, he wouldbe 
transferred to a summary page for that user, illustrated by the screen shot of Fig. 27. 

As explained above, the present invention is not limited to text searching, 
but can find relevant infomiation even when text does not match. This is accomplished by 
establishing the relevance of data based upon correlating a searcher's selected data with 
the data structure of database 10. In order to achieve this, database 10 must contain 
information representing the stmctural relationship of information in database CD, and 
that infoimation must be updated as the content of database CD is changed. 

In creating database 10, it is first necessary to define categories of 
information in database CD. For example, in the database represented by the screen of 
Fig. 23, each of menus 40 could represent a separate category. In Fig. 23, each of the 
categories is "scalar", in that there are a set of unique selections without subcategories. It 
is also possible to have a "dual scalar" or two-dimensional scalar category. For example, a 
geographical d atabase m ight h ave 1 ongitude and latitude. T riple or higher order scalar 
categories are also possible (e.g.. a geographical database could include altitude). 

Another structure for categories might be a "hierarchical" structure. This 
structure has the form of a tree. For example, the dating database could include a category 
for religion. That category could include a first level of subcategories, such as Christian. 
Jewish, and Moslem. Each of these religions would then be divided into fiirther 
subcategories. 
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For example, the Christian category could be sub-divided into Catholic and 
Protestant, with each of those being further subdivided into different sects. 

In accordance with the present invention, it has been found that better 
search results can be obtained by using a correlation procedure which is different for 
5 different types of data structures, hi creating the structure database 10, each record (e.g., 
the information relating to a single candidate) would be parsed into categories, and the 
database 10 would retain information regarding the structure of each category. Thereafter, 
in determining the relevance of a particular record, the searcher's selections in each 
category would be correlated to the structure of that category in order to arrive at a value 
10 representing the relevance of that category. All of the categories in the record would then 
be processed, for example, by averaging, in order to arrive at a quantity representing the 
relevance of the record. In this manner, a relevance value is obtained for each record. 

As an aid to understanding the relevance determination process, it is 
convenient to characterize categories in terms of a tree structure. For example, a character 
15 with a hierarchical structure could be represented as a multi-level tree as illustrated in 
Figure 28. Here, the category is represented by the top node 60, while the sub-categories 
are represented by the nodes 62a-62b, and the level of information below that is 
represented by the nodes 64a-64d. Similarly, as illustrated in Fig. 29, a scalar category 
could be represented by a tree with a single top node, 70, representing the category and 
one secondary level of nodes 72a-72e representing the sub-categories. Other fonns of data 
structures are possible and could be similarly represented by a tree structure with nodes. 
However, those skilled in the art will appreciate that the invention is not limited to 
categories and sub-categories that can be represented by a tree structure. For example, the 
concepts of the invention are equally applicable to data structures that can be represented 
as a set of scalar values. In the dating site example, a searcher might designate his address 
by latitude and longitude (or street and avenue) in order to locate dating candidates within 
a certain distance. The structure of this date is a multi-dimensional vector. 

Fig. 30 illustrates the process for determining relevance of a category 
having a hierarchical data structure. This involves generating a selection tree TS and a 
data structure tree TD. In each tree, corresponding nodes are similarly numbered. This is 
only necessary to assure consistent treatment of corresponding nodes so that the 
numbering may be somewhat arbitrary, hi the selection tree TS each node has a binary 
weighting next to it. A node which is selected by the searcher is given a weight of 1 and a 
node which is not selected is given a weight of 0. hi the data structure tree, node weights 
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are assigned starting at the lowest level nodes, which are assigned a weight of 1.0, and 
decreasing weights are assigned to each successively higher level of nodes. It is presently 
preferred that each successively higher level of node be provided a weight which is 90% 
of the weight of the next lower level node. Thus, nodes at the second level from the 
5 bottom are assigned a weight of .9, nodes at the third level fiom the bottom, are assigned a 
weight of .81, and so forth. In order to obtain a relevance value for the category 
represented by these trees, corresponding nodes weight values are correlated to arrive at a 
category relevance value. It is presently preferred that for a hierarchical data structure, the 
well known cosine coefficient algorithm be used for relevancy determination. That 
10 algorithm could be represented by the equation 1 : 



|;Di»S 

iml 



Where Ra(S,D) is the relevance value of the category, Di and Si are the 
15 weighting categories assigned to the node i of the trees TD and TS, respectively (the nodes 
are simply processed pair-wise), and N is the total number of nodes. 

Fig. 31 illustrates the preferred process for determining the relevance value 
of a category having a scalar structure. Once again, binary node weights are assigned to 
tree TS based upon whether a node is selected, hi the Tree TD, a weight of 1 .0 is assigned 
20 to the selected sub-node. Progressively lower weights are than assigned to the remaming 
sub-nodes, depending upon their distance from the selected sub-node. It is presently 
preferred that the weight of a sub-node be multiplied by .9 for each position that it is 
removed from the selected sub-node. By assigning weights in this mamier, it is possible to 
attribute value to a sub-node in the database based upon how close it is to the selected 
25 value. Thus, a record in which the selected node does not correspond to the value in the 
record will still be given effect in the relevance determination, depending upon how close 
the value in that record is to that selected value. It has been found that the cosine 
algorithm is unreliable when used with scalar categories, because it eliminates the 
contribution of any unselected node to the relevance value. Accordingly, it is more 
30 desirable to use a relevance algorithm which does not do this. For example, the algorithm 
represented by equation 2 is presently preferred for scalar categories. 
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Once a relevance value has been obtained for each category, these values 
can than be combined, for example by averaging, in order to arrive at a relevance value for 
the entire record. If such averaging is utilized, it is preferred to ignore all unselected 
categories in the evaluation process. 

The process for generating a relevance value for a record is summarized in 
the flow chart of Fig. 32. The process starts at block 100 and, at block 102. the first 
category in the record is selected. At block 104, the relevance algorithm utilized is 
determined, based upon the data structure of the category. In block 106. the weights of the 
respective nodes of the selection tree TS and the data structure tree TD are correlated 
using the selected relevance algorithm. Preferably, the algorithms discussed above are 
utilized. 

At block 108 a test is made to determine whether all categories in the 
record have been processed and, if not, the next unprocessed category is selected at block 
110 and control returns to block 104 to process the next category. If it is detennined at 
block 108 that all categories have been processed, control transfers to block 112. where 
the relevance values of the categories are combined to produce the relevance value of the 
record. Preferably, this is done by averaging, as described above. At this point the 
20 process tenninates, since the relevance value of die record has been determined. 

Having a relevance value for each record, it is now possible to produce a 
report for the searcher, preferably in the order of relevance value. 

Further aspects of the present invention will be described in the context of 
an alternate embodiment, which realizes an improved message board or user forum and 
25 also exemplifies the user reputation aspect of the invention. 

In late 1998, a law firm "Firm J" was losing associates faster than it could 
hire them. To stem the tide. "Firm 7" decided to give all associates a year-end "boom- 
year bonus" of $ 1 5,000. 

At o ther firms, c onfiision r eigned. Law firms h ad 1 ong m ade a p oint o f 
paying associates the same amount from firm to firm. Should all firms now raise salaries 
to match "Firm J"7 Or could they get away with leaving things as they stood? 
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Firms responded inconsistently: some matched the "Firm i" bonus, a few 
increased it, and others paid nothing. Associates who hadn't received bonuses were 
resentful, but there was little they could do. 

The following year, an anonymous associate started a message board on 
Yahoo! called-Greedy Associates." A ssociates using this board hoped that by talking 
about their finns online, they could put pressure on law firm partnere to match "Firm 1" if 
boom-year bonuses were given a second time. 

The logic was that an online message board would create accountability. 
Firms that hadn't matched "Firm 1 " in 1998 thought they could get away with it because 
nobody would know. Law students considering working at those firmswould have no 
way of learning whether, or how much, those fimis had paid. By creating an online 
message board to talk about salaries and bonuses publicly, lawyers could create a 
repository for this kind of information and force their firms to match market leader. 
Fimis that chose not to would be taken to task, and would have a harder time recruiting 
15 new attorneys. 

The Greedy Associates board was wildly popular, receiving up to 80.000 
hits per day. As soon as a firm decided to give (or not to give) a bonus, news went out 
immediately. Associates sometimes learned that they had received bonuses on Greedy 
Associates before receiving an official memo from their firms. Greedy Associates became 
the new grapevine, and before long associates at most firms were checking the board 
several times a day. 

The board made the frontpage o f the New York Times when gossip on 
Greedy Associates led New York law firms to pay large bonuses in order to match 
California firms. In the past, the California raises might have been ignored. But with 
Greedy Associates publicizing the buzz among lawyers, law firms felt they were under a 
microscope. They could no longer ignore what people were saying about them online 
without putting their reputation at risk. 

For the first time lawyers had been given a conduit to exchange 
information, and the information they exchanged was not limited to salaries: firm culture, 
clients, layoffs, and general gossip were all discussed. Before the Internet, this would 
have been impossible. Now it was easy. 

Greedy Associates was popular in spite the incredibly poor quality of its 
underlying technology. "This board sucks." was the message most commonly posted to 
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Greedy Associates. And it did. The fact that Greedy Associates became so popular is a 
testament to the incredible demand for the service, not the quality of the site. 
Three problems stood out: 

• SfiaflL Most messages weren't worth reading. They ranged from long rants 
to advertisements to messages like "Right on!" Users looking for specific infomiation or 
good conversation were forced to wade through huge amounts of spam before finding 
what they wanted. 



) 



• Static Boards . Although there was only one Greedy Associates board, all 
kinds of different people, with different interests, were reading it. California litigators 
were thrown in with New York corporate lawyers; ambulance chasers from Alaska were 
grouped with tax lawyers from Texas. As a result, most users were forced to read 
messages about subjects they weren't interested in. This was just as bad as making them 
read spam. If a message doesn't apply to you and you're not interested in it, it may as well 
be spam. 

Because of the divergent interests of its users, the original Greedy 
Associates board eventually fractured i nto almost fifty separate boards with n ames 1 ike 
Greedy NY Associates, Greedy SF Associates and Greedy NY Tax Associates. Every 
variation on the theme was played. And of course, because they were far smaller than the 
original Greedy Associates board, each subsidiary board was far less useful. 

• No Real Anonymity. One of the chief reasons for the popularity of Greedy 
Associates was the anonymity it offered. By speaking under a pseudonym, people felt 
they could reveal more than if their identity were known. 

But as many people realized, the anonymity offered by Greedy Associates 
was limited. As most sites do. Greedy Associates secretly recorded information about its 
users and would disclose this information if served with a court order or subpoena. As a 
result, people who might otherwise have contributed to the conversation remained silent 
for fear of revealing their identity. 

The present inventor originally set out to solve the problems observed on 
Greedy Associates. Efforts were focused on four discrete issues: 

• Dynamic . Static hoards are clearly prohlematic, yet no message board product 
provides a non-staticsolution. A dynamic product, in which the contours of a 
"board" can expand or contract as users desire, is required. 
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• Self-Regulating . Spam and low-quality messages choke off meaningful 
conversation before it ever has a chance to start. A self-regulating board in 
which messages that users don't want to see vanish before others are forced to 
read them results in less spam and more high-quality dialogue. 

• Anonymous. Valuable information about the intimate details of specific firms 
attracted people to Greedy Associates, but the lack of true anonymity prevented 
the most interesting information from ever being posted. The option of posting 
information with true anonymity Is necessary to give users the freedom to post 
the kind of information that others want to see. 

• Organic. Certain areas of message boards are heavily used and deserve to be 
expanded. Others are rarely used and fall into neglect. A good product should 
be organic: it should respond naturally to the demands that users place on it. 
Areas that are heavily used should automatically expand; areas that are rarely 
used should automatically contract (or even vanish). 

Solutions Have Wide Application. It soon became apparent that the problems 
observed on Greedy Associates are endemic to message boards generally, and that the 
solutions have widespread application to virtually any kind of online community. 

As a result, instead of designing a better version of Greedy Associates, the present 
invention created a process and system to allow Web sites of any kind to implement the 
20 solutions discovered. 
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A. Dynamic Model. 

The Proble ms With Static Models . 

Online speech is stored using static methods. A post might be found on a 
specific "board." an article in a "section" of a magazine, or a photograph as part of an 
"album." These storage models separate content into individual spaces with fixed 
boundaries. People know that messages about Honda Accords, for example, are found on 
the Accord bulletin board in the Honda section, or that messages about Cisco Systems are 
found on the Cisco bulletin board in the Companies section. The path to a specific item is 
30 always the same, and follows a simple categorization scheme. 

This is a bad system. To understand why it is bad, it is useful to understand 
how a static system is structured. Consider Vault.com. a premier message board for job 
seekers. A simplified structural model of Vault.com*s message boards relating to "Law" 
appears in Figure 1. 
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In the system of Figure 1, boards are grouped into three categories: Firms, 
Industries and Topics. This appears logical and would seem to provide a clear framework 
for posting messages. But it doesn't. 

a. Bonuses at "Firm J ": A Simple Example. 

Suppose a user wants to post information about bonuses at "Firm 1". Where 
should he post the message so that others will find it? There are three possibilities: The 
""Firm 1 "" board, the "Law" board or the "Salary Infonnation" board. 

Few users would take the time to post their message to all three relevant boards, 
and if they did it would simply create another problem. People who read all three boards' 
would find themselves reading the same message over and over again. Thus, there is no 
one logical place for a user of the above system to post a message about bonuses at "Firm 
J", and no obvious solution to this problem. 

The lack of a clear answer to the question of where a message on a specific subject 
should go creates difficulties for users. In the above example, users may read the "Firm /" 
board without ever realizing that messages about "Firm 1 "are also posted on both the 
"Law" board and the "Salary Infonnation" board. For these users, the system is under- 
inclusive because it fails to show them all the messages that they want to see. But users 
who look for messages about "Firm 7" bonuses on the "Firm 1" board have the opposite 
problem. These users may be forced to read through numerous messages about "Firm 1" 
that don't deal with bonuses. For these users, the system is over-inclusive because it 
shows them many messages that they don't want to see. 

b. Comparisons Within Groups: A Complex Example. 

Problems with static systems are even greater for users who want to post messages 
about several different subjects within the same group. Suppose, for instance, that a user 
wants to compare the bonus given at "Firm I " with the bonuses given at another specific 
firm (Firm 2"). Where should he post this message? 

There are five boards where this message could reasonably be posted, but none of 
them are precisely right. It could be posted to the "Firm I " board, the ""Firm 2"" board, 
the 

"Firm 3" board, the "Law" board or the "Salary Information" board. Whichever board the 
information is posted to, however, it's virtually certain that many users who would find it 
interesting will never see it. Jn some embodiments, it would not be posted to the "Firm 3" 
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board ( or o ther b oards r esulting f rom t he filter selection o f o ther firms that a re n either 
"Firm I " nor "Firm T). In other embodiments, it would be posted to one or more other 
boards resulting from the filter selection of other firms that are neither 'Firm I " nor "Firm 

ry 

5 Even if the poster feels sure that he should post his message to one of the boards 

grouped under "Firms," there's no clear answer as to which is best. Since no answer is 
clearly correct, any selection is sure to confuse users to some extent. The only board 
which would be clearly correct would be one dedicated specifically to comparisons of 
"Firm 7", "Firm 2" and "Firm 3". And no such board exists. In some embodiments, it 
10 would not be posted to the "Firm 3" board (or other boards resulting firom the filter 
selection of other firms that are neither "Firm I " nor "Firm 2"), In other embodiments, it 
would be posted to one or more other boards resulting firom the filter selection of other 
firms that are neither "Firm I " nor "Firm 2"). 

15 ii. The Advantages of Dynamic Boards . 

The present invention allows companies to create dynamic message boards. Figure 
2 shows one embodiment of a system utilizing the present invention. Other embodiments 
can remove, add to, change, and^or rearrange the shown components. In a dynamic 
system, messages are not situated in individual areas with clear boundaries. No clearly 

20 defined "boards" exist, histead, the user selects filters which the system uses to generate 
"boards" fi-om a message database. Consider how the two problems discussed in the 
previous section would be solved by a dynamic system. 

a. Bonuses at "Firm J ": Solving the Simple Example. 
25 If a company like Vault.com were using the present invention, it might use filter 

categories such as "Firms," "Industries," and "Topics." In some embodiments, the filter 
categories are "hardwired" into the system. In other embodiments, the filters are 
dynamically generated. A user interested in bonuses at "Firm 1" would select the 
following filters: 

JO 

I ¥\ms="Firm V 



Topics=Salary Information 
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Although the user has not selected a filter for Industries, this filter will 
automatically be set to "Law" because "Firm 7" is a law firm. If the user had selected a 
banking firm, the Industries filter would automatically have been set to "Banking." The 
database understands the relationships between filters and fills in unselected filter boxes 
with appropriate information. This understanding can be either "hardwired" into the 
system, or can be dynamically generated. Some examples of relationships generally are 
shown in Figure 3. Thus, even though the user has left Industries blank: 



Industries=Law 



10 

Now t hat t he filters h ave b een s et, t he u ser c licks " Apply." T he s oftware s orts 
through the database and pulls out all messages, articles and other content related to both 
'Tirm 1 " and Salary Information (area A in Figure 4). This infomiation will be displayed 
first, in a format indistinguishable fi-om an ordinary message board. The Present invention 
15 next pulls out all information related to Law Firms (other than "Firm 1") and Salary 
Information (Area B). This information will be displayed next. 

b. Comparisons Within Groups: Solving the Complex Example. 

The advantages to this system become clearer if we reconsider the complex 
20 example, in which the user wanted to post a message comparing bonuses at "Firm V\ 
"Firm 2" and'T/rw 3", In a dynamic system, the user would select the following filters: 



Firms= "Fzrm I" 
"Firm 2" 
"Firm 3" 



Topics=Salary Information 



As in the previous example, the Industries filter will automatically be set to "Law'' 
because the firms selected are all law firms. Thus; 



Industries=Law 



The Present invention will sort through the database and pull out all messages, 
articles and other content related to "Firm 7", "Firm 2"/Firm 3" or Salary Information. 
30 Some embodiments pull out content related to law firm information for law firms that are 
none of "Firm J", "Firm 2", and "Firm 3". Some embodiments pull out content related to 
the law industry. It will then order the data so that the most relevant information will be 
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displayed first. Figure 5 shows an example of selected categories of content and user 
selected categories being used as inputs to generate relevances. 

The first messages to be displayed will be those tagged with "Firm }'\ "Firm 
2'VTirm 3" and Salary Information Oabeled "A" in Figure 6). These messages will be 
most likely to contain the content that the user is looking for. By selecting these filters the 
user has, in effect, created a custom "board" designed specifically for him on precisely the 
subject he is most interested in. 

In one embodiment, a message relating to firm 1, firm 2, and salary is rated higher 
than a message relating to firm 1, firm 2, and firm 3. In another embodhnent, a message 
relating to firm 1, firm 2, and salary is rated lower than a message relating to firm 1, firm 
2, and firm 3. 

The next messages to be displayed will be those labeled "B." The Present 
invention will combine messages about "Firm J" & "Firm 2", "Firm J" &"Firm 3" and 
"Firm 2" &"Firm J" (all of which are also about Salary Information) and will sort them 
using a number of factors. In some embodiments, these factors can include a fiizzy math 
algorithm. In some embodiments, these factors can include an algorithm combining scalar 
values. After these messages have been displayed, the Present invention will display 
messages labeled "C," which deal solely with ''Firm r\ "Firm 2" or "Firm 3" and the 
messages labeled "D," which deal with Salary Information and Law Firms, but not with 
"Firm 1 "Firm 2" or "Firm 3" specifically. In some embodiments, the above order can be 
changed; for example, including messages which do not deal with salary information. 

Allowing users to display messages in this way solves the problem described in the 
last section. Users who wish to pull up information on "Firm "Firm 2", "Firm 3" and 
Salary Information will see, first and foremost, the information most interesting to them. 
If, while looking at this "board," they choose to post a message, their message will 
automatically be tagged with "Firm J", "Firm 2", "Firm 3" and "Salary Information." It 
will be among the messages likely to be displayed when another user performs a search 
using the same filters. 

Unlike a search that uses only Boolean keyword searching, some embodiments of 
the invention allow searches to yield results which may not be 100% on point but still 
have relevance. For example, in an embodiment managing products, a customer can find 
products with varying degrees of relevance to the filters, and not just the 100% relevant 
products. If the merchant does not have one or more of the products sought by the 
customer, at least the merchant can present related products of interest to the customer. 
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In e mbodiments s uch a s t he d iscussed embodiment, a u ser can find i nfonnation 
which may not be 100% on point but still have relevance. 

In order to match data in a database with a given query, we take advantage of 
relationships (also known as "links") that we establish between the data and the query. 
These relationships are often, but not always, segmented across several different 
categories (such as age, height, weight, location, price, etc.). 

Every piece of content in a TRANSPARENSEE™ system is tagged with a set of 
weighted categories. Any query made to the system is also translated into a set of 
weighted categories. Our system assigns a numerical value to the degree of similarity (or 
difference) between these two sets of weighted categories through the use of our 
"Similarity Algorithm". 

The steps of the Similarity Algorithm are as follows: 

1) Determine the weights of an element of content's tagged categories. 

2) Determine the weights of the categories used in the selection (or query). 

3) For each piece of tagged content: 

3a) For each category (such as age, height, weight, location, price, 

etc.). 

3aa) Find the similarity of the content's category weights to 
selection's category weights. 

3b) Aggregate the similarities across all root categories for this piece 
of content. 

The output of this calculation is a mapping of content object to relevance value. 
The Similarity Algorithm can be customized in several ways: 
Step 2) When a selection is passed into the algorithm, the weight on each category 
is either 1 or 0: 1 if the category has been explicitly selected and 0 if it has not. The 
Similarity Algorithm uses the relationships (links) between categories to assign weights to 
categories that are related to the explicitly selected categories. These relationships (links) 
could be sibling relationships, parent/child relationships, cross-linked relationships(links 
to categories under other root categories) or any other type of relationship. Weights 
assigned to categories as links are traversed based on the weight of the originating 
category in the link. The modifier used to assign weights to linked-to categories is 
adjustable. 

Step 3a) If desired, certain root categories can be ignored. 

Step 3aa) The method of comparison between die category weights in the 
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selection and the category weights in the content is customizable. One method of 
comparison that can be used is a Cosine Coefficient algorithm. 

Another method of comparison that can be used is the "Smithgate Algorithm", 
which we developed ourselves. Any other algorithm can be used to determine the degree 
of similarity between two pieces of tagged content. 

Step 3b) The aggregation algorithm can take into account weights or rankings 
of the root categories, since certain root categories may be more importantthan other root 
categories. 



B. Rating Messages 

•Hie dynamic model described in Section A provides a powerful tool for organizing 
content. Used in conjunction with a sophisticated rating system, it is capable of far more. 

A dynamic system automatically captures "metadata" each time a user posts a 
message. Examples of metadata are the filters set when a message is posted and ratings 
information. Because we know which filters are set when a message is posted, we know 
(in broad terms) what the message is about. As users rate messages, the system therefore 
develops a sophisticated profile on which subjects users are experts on. 

This profile allows the system to do two things that can't be done on static 
systems: users can screen content so that people with poor reputations on this subject are 
ignored; and ratings given to specific messages can be weighted by the user's knowledge 
of the subject. 

From a user's perspective things are simple: just point and click to give a message 
a rating between one and seven. Other rating systems use other scales. Some 
embodiments can have discrete and/or continuous rating systems. But the Present 
invention manages to do subtle and complex things with this simple rating. 

i- Reputation System 

Each user builds a reputation over time. This reputation is not a single number, but 
a profile made up of many numbers. Users build reputation ratings for each filter value of 
every message they've ever posted or rated on the system. Figure 7 shows an example 
flow chart for updating a user'srating. Steps can be added, removed, changed, and/or 
rearranged. 
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There are two ways of building a reputation: posting messages and rating 
messages. Posting a message gives the system substantial data to evaluate. Reputations 
gained through posting are therefore difficult to influence once established. In contrast, 
rating a message gives the system limited data to evaluate. Reputations gained by rating 
are therefore easier to influence. Thus, posting allows users to build "strong" reputations 
which can't easily be changed while rating messages allows users to build "weak- 
reputations which can be changed quite easily. 



10 



a. Building a Reputation by Posting: Strong Form. 

Consider an example in which a poster posts a message comparing "Firm l'\ 
"Firm 2"and"Firm 3". For the moment, let's contemplate only the "Films'' filter, which is 
set as follows: 



15 



20 



25 



30 



Finns=Firm J 
Firm 2 
Firms 



In this case our rater, thinking the poster's message brilliant, gives it a 7. Our rater 
has aheady built a reputation, and his reputation for the selected firms is: 



"Firm!'' =7 

"Firm 2" =4 

"Firm 3" =1 

The situation now looks like this: 



(high) 

(medium) 

(low) 



Filter Value 


Rating 


Rater's Reputation 


'Firm J" 


7 


7 


"Firm 2" 


4 


'Firm 3" 


1 



For each filter the rating of seven will be weighted by the rater's reputation and 
then averaged into the poster's reputation. Let's go through this example to see how this 
would work. 

The rater has a reputation of seven for "Firm I". He is an expert on the subject. 
Since an expert on "Firm I" gave a message involving "Firm 1" a top score, the poster's 
reputation on "Firm 1" will go up substantially. The rating of seven will be averaged into 
the poster's reputation on "Firm 1 " and will be heavily weighted. 

The rater has a reputation of four for "Firm 2". This means that, while not entirely 
ignorant, he isn't an expert. Although he gave the message a seven, we shouldn't trust his 
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opinion on "Firm 2" as much as we did his opinion on "Firm 1". The rating of seven will 
be averaged into the poster's reputation for "Firm 2", but will not be weighted as heavily 
as his rating of "Firm 1". The poster's reputation for "Firm 2" will rise, but not as much 
as his reputation for "Firm 1 ". 

5 As for "Firm 3", the rater has a reputation of one. He knows nothing about "Firm 

3", so we shouldn't trust his opinion at all. Even though the rater gave this message a 
seven, the rating will have no weight and will not affect the poster's reputation. In other 
embodiments, the weight has nonzero but low weight. 

) b. BuildingaReputationby Rating: Weak Form. 

Not all users are comfortable posting messages. For this reason, a weak form of 
building reputation that does not depend upon posting is also available. 

All that is required of users to build this type of reputation is that they rate 
messages. Each time a user rates a message, the system performs a "cluster analysis" on 
the rating, hi altemative embodiments, the reputation of the user is adjusted less 
frequently than every time the user posts a message. 

This involves comparing the user's rating with ratings given that message by 
people with high reputations. If. over time, a user's ratings on a particular subject tend to 
correlate with the ratings of high reputation people on the same subject, we can assume 
that the user is trying to rate messages honestly and fairly and that he knows something 
about the subject. His reputation in this area will rise. But if the user's ratings tend to 
disagree with the ratings of people with high reputations, his reputation will fall. 

A reputation built in this way is "weak" in the sense that it may rapidly be changed 
by the strong fonn of reputation-building. For example, a user may build up a reputation 
for ""Firm 1"" over time using the weak method. Eventually this user may decide to post 
a message about "Firm 1". If the message receives a good rating from high-reputation 
users, the user's reputation for knowing about "Firm 1" will be reinforced. But if the 
message receives a bad rating, the user's reputation for knowing about "Firm I" will 
quickly be eroded. One or two bad "strong" ratings of posted messages are enough to 
destroy a "weak" reputation built up over a period of months, other embodiments, more 
than two such messages are enough to destroy the reputation. 



Message Rarin pe 
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Just as users have reputations, messages have ratings. Message ratings are 
determined by the scores useis give them, weighted by the relevant reputation of the 
raters. 

Let us go back to our example of the previous section. The situation was as 
5 follows: 



10 



25 



Filter Value 


Ratios 


Rater's Reputation 


*Tirm 2" 


7 


7 


"Firm 2" 


4 


"Firm 5" 


1 



The rater has given this message a seven. But the rater does not have a perfect 
reputation for all the relevant filters. He knows quite a bit about "Firm 1", but only a litfle 
about "Firm 2" and nothing at all about "Firm 3". 

The system aggregates the rater's reputation in these fields using a mathematical 
fomiula. hi this case, the rater's aggregate reputation for "Firm J", "Firm 2" and "Firm 3" 
is four. The system will average the rating of seven into the message's rating, giving it a 
weighting of four. Figure 8 shows an example of calculating an aggregate reputation. In 
15 other embodiments, nonuniform weights are given to the multiple rater's reputations. In 
one embodiment, the scale of 1-7 is rescaled to 0-1. Other embodiments rescale ratings to 
different continuous or discrete ranges. 

If the user had had a perfect reputation for knowing about "Firm V\ "Firm 2" and 
"Firm 3", the weighting would have been a seven. In that case the user's rating of seven 
20 would have been averaged into the message rating with a weighting of seven. The 
message rating would count twice as much as it did in the prior example. In some 
embodiments, the weight of a message has a linear relationship with the rating of the 
message. In other embodiments, the weight of a message has a nonlinear relationship wiA 
the rating of the message. 

In some embodiments, a message has one rating. In other embodiments, a message 
has multiple ratings, for example different ratings for different filters or sets of filters. 

C. Threshold Filtering. 

The rating system works hand in hand with a system to filter rated messages. The 
30 filtering system allows users to select a rating threshold and view only those m essages 



with ratings above that threshold. Other messages are not 



seen. 
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Method of Threshold Filterinp - 

To set a threshold, users click the "threshold" button. A palette appears, containing 
a scatteiplot as in Figure 9. Other embodiments use an interface other than a scatteiplot, 
such as one or more selectors of reputation and/or message rating. Each dot represents a 
message. In other embodiments, dots represent approximations of messages and do not 
have a one-to-one correspondence. By selecting a point on the scatterplot. users can 
choose any combination of message quality and reputation quality. In some embodiments 
where messages have multiple ratings, such as for different filters, a user can select ratings 
directly or indirectly. Other embodiments permit selection of just reputation or just 
message rating. Suppose, for example, that a user selects the point on the scatterplot as in 
Figure 10. 

By selecting this point the user has chosen to view messages of high message 
quality without much regard to the reputation of the poster. 

A different user might have selected the point on the scatteiplot as in Figure 1 1 . 

This user has chosen to view messages posted by users with high reputations 
without much regard to message quality. Many users will, of course, select a point like in 
Figure 12: 

This user wants to see only those messages of high-quality which were written by 
people with high reputations. By selecting this threshold, this user will likely see only the 
very best messages that have been posted. 

Results of Threshold Filterinpr 

In combination with the reputation system, this method of threshold filtering 
allows people to build communities of self-validating experts. These experts are 
encouraged to post good content and to rate content they see accurately. 

By posting good content or rating content accurately, users build high reputations. 
People with high reputations become community leaders because their voices are heard by 
others. People without high reputations are excluded from the community because their 
voices cannot be heard. 

In the diagram of Figure 13. "Average Threshold" represents the average 
combination of reputation and message rating selected by users of a certain filter-set (such 
as '"'Firm 7"" and "Salary Information"). 
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Other embodiments use an interface other than a scatterplot, such as one or more 

selectors of reputation and/or message rating- 
Users whose quality of speech places them above the average threshold will be 

heard. In this way, the Present invention formalizes a process that takes place informally 

all the time: people who speak intelligently and often become recognized as authorities. 

But the system does something more. Users whose quality of speech places them 

below the average threshold will, on average, not be heard. Their speech is never seen by 

the average user. 

People who say foolish things or post spam will find it difiRcult to post messages 
which fall above the Average Threshold. They will quickly estabUsh a poor reputation. 
Thus, in addition to providing incentives to post good content, the system provides 
disincentives for posting bad information. People are encouraged to say good things and 
discouraged from speaking if they have nothing good to say. 

D. Implicit Reputation. 

Filters often have clear relationships between them. ""Firm 1"," for instance, is a 
law film. Thus, as described in Section ni(A)(ii), when ""Firm 1"" is selected (for 
"Firms") "Law" is automatically selected (for "Industries"). 

This means that as people build reputations in specific categories, they 
automatically build reputations in other related categories. The relationships between 
related categories can be "hardwired" and/or dynamically determined. A person who 
builds a reputation for "Firm 1" simultaneously builds a reputation for Law. See Figure 
14. 

If, after speaking well about Salaries at "Firm 1" a person decides to speak about 
Salaries at Law Firms generally, they will already have established a reputation for both 
"Law" and "Salaries." Their advice on Law Firms will be trusted because, by 
demonstrating that they know about "Firm 1", they've shown that they know about Law 
Firms generally. 

If they say bad things about Law Firms, their reputation for Law Firms will decline 
but their reputation for "Firm 1" will be unaffected. In other embodimems, their 
reputation is affected poorly. In other embodiments, good messages raise their reputation 
for "Firm J". After all, they've already established that they know about "Firm 7". The 
fact that they don't know about other firms doesn't diminish that. 
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E. Annotation System 

The use of filters as described permits a unique annotation system. This system 
lets users annotate content with their comments, which are appropriately tagged and filed 
by the Present invention. In this way proprietary content becomes the seed fi-om which 
thousands of related messages sprout, filling the database with interesting, pre-sorted 
messages. Figure 15 shows an example flow chart of annotation posting. Steps can be 
added, removed, changed, and/or rearranged. 

i. Creating Annotations. 

Proprietary content is first tagged, sentence by sentence, with appropriate filters by 
the site operator. In other embodiments, tagging occurs more fi-equently, for example 
word by word, or group of words. In other embodiments, tagging occurs less firequently, 
such as in multi-sentence blocks or paragraphs. 

As shown in the paragraph of Figure 16, the user cannot see the filter values 
attached to each sentence. These are invisible. All he can see are the sentences about 
"Firm I In other embodiments, the user can see one or more filters. 

The filter values come into play when the user decides to annotate a sentence. 
Suppose that the user decides to conmient on the third sentence in the above paragraph. 
They select the sentence to annotate, then enter their comments, as in Figure 17. 

Smce we know that the sentence being annotated is about John Doe, a partner at 
"Firm r\ we can feel reasonably sure that the annotation is about the same subject. The 
system therefore automatically tags the annotation with the same filters as the original 
sentence and files the annotation in the database using those filters, hi other embodiments, 
filters can be added, changed, and/or subtracted, automatically or by selection. 

ii- Viewing Annotations. 

There are two ways to view annotations: annotation format and message format, 
a. Annotation Format. 

When viewing annotated text, users can select a sentence to view its annotations. 
Thus, a user reading a description of "Firm I" would simply select any sentence for more 
detail. 

Like messages, annotations are rated and filtered. Annotations that fall above a 
user's threshold are displayed. Annotations below the threshold are not seen. 
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Thus by selecting any sentence in a description, a user can immediately read the 
best comments on that sentence. Comments by users with reputations for knowing the 
subject matter are more likely to be seen than comments by less knowledgeable users, and 
good messages are more likely to be seen than bad. 

Since amiotations are filed in the message database, they can also be pulled up as 
messages. 

The annotation in diagram seven, for instance, is tagged with the following filters: 



Firms="Fim7" 



Topics=Hours 
Partners 



Partners=John Doe 



30 



This annotation will therefore come up as a message whenever a user se ts their 
filters in a way that substantially overlaps with these filters. Thus, if a user sets their 
filters to -"Firm 1"" and "Hours." this message is likely to be displayed. It would also be 
displayed during a search for ""Firm and "Partners" or "Law Firms" and "Partners." 
And it's almost certain to be displayed in a search for "John Doe." 

For both annotations and other messages, the order in which they are displayed can 
be influenced by relevance and/or rating. 

F. Since annotations can also be viewed as messages, persuading users to annotate 
content will seed the system with initial messages and get conversations 
started. As long as the site starts with content users want to respond to. 
discussions will be started and placed into the system with enough filters' 
attached so that appropriate messages appear during any related search. 
Because each message will have many filters attached, users will perceive the 
boards on the system to be fiill even though only a few messages may have 
been posted. 

Anonymity provides a powerfiil incentive to speak about sensitive subjects online. 
Indeed, the mere perception of anonymity felt by online speakers has contributed to an 
enormous outpouring of gossip on the Web. But as Time Magazine reports: 



wo 2004/044705 PCT/US2003/036045 

28 

Although the sites give their posters - who generally use pseudonyms - a 
feeling of anonymity, they're usually not anonymous at all. Faced with a 
subpoena, most sites will readily divulge a poster's name to the authorities. 

5 Although a lack of anonymity can create a chilling effect on speech, giving users 

anonymity causes other problems. Anonymous speakers are not accountable for their 
speech and feel free to post spam and low-quality messages because speech can't be traced 
back to them. 

The Present invention's rating and filtering systems solve these problems by 
10 creating accountability for anonymous speech. Users who speak poorly or spam the 
system will receive low ratings. Their messages will not be seen and they will discover 
that their speech has become invisible to others. On the other hand, users with good 
reputations will be able to speak anonymously with the knowledge that their speech will 
be heard, although their names remain unknown. 
15 The Present invention protects people's identity in two ways: its four levels of 

anonymity and its use of onion routing. 



i. Four Levels of Anonvmity. 

The Present invention provides four different levels of anonymity. Users can 
20 change their anonymity level before posting messages in order to ensure that sensitive 
messages receive as much protection as they deserve. Figure 18 shows an example flow 
chart of posting at different levels of anonymity. Steps can be added, changed, removed, 
and/or rearranged. Figure 19 summarizes key features of different levels of anonymity. 
Levels can be added, removed, or changed. 

25 

a. Level One: Use of Pseudonyms. 

First level anonymity allows users to post messages using a pseudonym. Unlike 
other message boards, the software does not ask for information about the user that could 
link the message to their true identity. No e-mail address, credit card information or other 
30 information that could connect a user to the site is recorded. Information about a user's 
Internet service provider or IP address is not logged. All that the system requests from a 
user - and all it knows about a user - is their usemame and password. 

This means that if a site using the Present invention is subpoenaed to turn over the 
identity of someone who posted a particular message, it can't. Even if site operators 
35 cooperate to the best of their ability, the limited infonnation they have will be useless. 
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Asked who posted a particular message, the most they will be able to say is, "that message 
was posted by a person calling themselves 'Daffodil.' It was read by other users because 
Daffodil has a good reputation for knowing about the subject." 

5 b. Level Two: Anonymous Linked. 

Second level anonymity allows users to post messages as "Anonymous." Although 
other users cannot tell who posted an anonymous message, the Present invention keeps 
track and continues to link a user's reputationto themessages they post. A nonymous 
messages may therefore benefit from a poster's high reputation, and ratings given to 
10 anonymously posted messages affect the poster's reputation. 

Messages posted using level-two anonymity are sometimes called "anonymous 
linked" messages because although the identity of the poster is hidden to other users, the 
Present invention keeps track of links between messages and their authors. The software 
"knows" who wrote which message, although other users don't. 

This makes the "private reply" possible. Suppose Daffodil decides to post a 
message critical of 'Mr. Big.' a partner at "Firm 1". Daffodil has posted messages about 
"Firm 1" before, and has a high reputation for knowing about the firm. She realizes, 
however, that readers will be able to determine her identity if they read this message in the' 
context of other messages she's written. 

For this reason Daffodil decides to post her message anonymously. Her high 
reputation for knowing about "Firm 1" is linked to the message, so many people will read 
it. And if they give it a high rating, her reputation for "Firm 1 " will go up even further. 

Suppose Mr. Big reads the message. He disagrees with Daffodil, but doesn't want 
to speak out publicly. He can click a button on the message marked "private reply" and 
send a private reply to Daffodil's internal mailbox on the system. He can send this reply 
to Daffodil even though he doesn't know that "Daffodil" is the person he's writing to. 
And if Daffodil replies to him, she can choose to do so anonymously. If this 
correspondence continues a private, detailed e-mail conversation can take place between 
these two without ever risking Daffodil's identity. And if she eventually becomes 
30 convinced that she wronged Mr. Big in her original message, she may decide to retract it. 

What if Daffodil decides not to retract her message? Mr. Big may become upset 
enough to serve the site with a subpoena demanding Daffodil's true identity. 

The most the site could give Mr. Big would be Daffodil's usemame. But even this 
might be enough to unmask Daffodil. By putting her message together with other 
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messages posted by Daffodil in the past. Mr. Big may be able to determine Daffodil's true 
identity. 



c. Level Three: Anonymous Unlinked. 

For this reason, the Present invention offers a third level of anonymity. Level three 
messages are also referred to as "anonymous unlinked." Like level two messages, they are 
posted under the usemame "Anonymous." But unlike level two. the system does not keep 
track of links between messages and their authors. When a message is posted, the system 
immediately stamps the message with a user's relevant reputation scores; it then severs the 
link between the user and the message and "forgets" the poster's identity. After a level 
three message has been posted, even the site operator is unable to determine who the 
author was. 

Because the message has been stamped with the reputation values of the poster, it 
can be filtered like any other. Messages posted by high reputation users will be seen md 
those posted by low reputation users will not. But users feel secure posting level three 
messages because they know that although their messages can benefit from their 
reputation scores, their identities are completely protected - even from the site operators 
themselves. 



2° d. Level Four: Complete Anonymity. 

For each of levels one. two and three, users are required to log on with a usemame 
and password before posting messages. Although their identities are protected, some users 
may feel uncomfortable providing even this limited information just prior to posting 
particularly sensitive messages. For this reason level four anonymity allows users to post 
25 messages without even logging in. Users are not required to give any information at all. 
Since they have not given any information to the system, and since the Present invention 
does not record IP addresses, infomiation about ISPs or place cookies on a user's machine, 
users can be assured of complete anonymity when using level four anonymity. 

A disadvantage to level four anonymity is that since the system doesn't know who 
30 the user is. they are unable to take advantage of their reputation. As a result, few people 
are likely to see messages posted using level four anonymity. This problem is not 
insurmountable, however. A user who posts a particularly interesting message using level 
four anonymity can simply log in at a later date, find their message, and give it a high 
rating (or. if they're to scared to risk themselves this way. they can tell a friend about the 
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message they "read" and give them enough information to easily locate it). O ne good 
rating will not be sufScient to ensure that the message is widely read. But it will give the 
message enough of a boost that a few more people will see it. If the m essage is truly 
interesting and deserves to be read, it's rating will quickly soar and it will be injected into 
5 the mainstream of conversation. 

Employers sometimes keep track of the sites their employees have been to. As a 
result, people are often afraid to access particular sites from work. 

Figure 20 shows an example of onion routing. The present invention avoids this 
problem through the use of packet wrapping. By using another site as a proxy server and 
10 -Vrapping" our IP packets with theirs, we can disguise the source of our packets. If we 
have a partnership with Yahoo.', for instance, we could route our signal through Yahoo!, 
which would cause employers to believe that their employees are using that site, not ours. ' 

Since filters are used to organize TRANSPARENSEE™ sites, it is important to 
ensure that sites have complete and current filter-sets. But it is difficult and expensive for 
sites to keep their filters up to date in real-time. This would require sites about law firms 
to know the name of every new law firm, and sites about restaurants to know the name of 
every new restaurant, as soon as they come into existence. 

An easier way is to give users the ability to add new filters. If the user of a job site 
doesn't see their firm listed, or the user of a restaurant site doesn't see a new bistro, they 
can add it to the filter set. Allowing users who know a subject best to find and repair weak 
spots in the system is the best and most cost-effective way to keep filters current. 

The potential disadvantage is that some users may insert incorrect filters into the 
filter-set. This can be prevented with TRANSPARENSEEtm's reputation system. 

Suppose a user notices that their law firm, "Firm 4" is not listed on a 
TRANSPARENSEE™ job site. The user would request that Firm 4 be added to the filter 
set and would fill out a form containing basic information about the firm. 

Since the user claims that Firm 4 is a New York law firm, it stands to reason that 
users with high reputations for "New York" and "Law Finn" will be in a position to know 
whether Firm 4 is real or not. The next time such users log into the system, they will see a 
30 poll in the comer of their screen asking: 
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Which of these is a New York law firm? 
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iSl Simpson, Thatcher & Bartlett 

□ Dewey Cheatem & Howe 

□ Bwahahahahaha :-) 



Users with high reputations for *Wew York" and "law firms" might be expected to 
answer this question correctly. But some may not. A malicious minority of users may 
check the wrong box. 

5 These users can be caught through cluster analysis. A simple algorithm allows us 

to determme what answers the majority of users gave and highlights those user, whose 
answers differed substantially. Tl^eir entries can be disregarded and their reputations 
dimmished. If their reputations go down enough, they will no longer be asked to answer 
polls of this type, m this way the filter-set can grow in response to the needs of users 

It can also shrink. If users fail to use certain filters over a period of time, those 
filters are removed fi-om the filter-set 

We term the ability to grow and shrink in response to user demand an "organic- 
element. The present invention makes a system highly organic. The filter-set, and thus 
the board .tself, responds to the demands of high-reputation users. By responding to users 
15 m real-tame and shaping itself to their needs, the system collects and verifies infonnation 
more rapidly and accurately than even a large staff could. Figure 21 shows an example of 
detem^mmg a discussion rating based on multiple factors. Fewer, more, and/or different 
factor, can be used. Such factors can also be used to rate filters and other features of the 
software. 

20 In addition to messages, the Present invention supports polls, articles, transcripts 

faxes. Word files, photos, audio and video clips and any other type of data. These types of 
content can beposted to the system, indexed, searched for. filtered and rated, just like 
messages. 

Posting an interesting fax. photo or Word file would result in a substantial boost to 
25 a user's reputation. Indeed, certain types of content are more likely to result in a 
reputation boost than others. If a user posts an internal memo about bonuses at -Firm 1" 
to the ->Firm 1- and "Salary Infonnation" board, his reputation in those areas will 
skyrocket. It will be clear to everyone using the board that this person works at "Firm 1 " 
and IS doing his best to feed good infonnation to others. This effect creates a strong 
30 incentive for people to post infonnation proving that they are "insiders " 
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Polls can only be posted to the system by high-reputation users. At the discretion 
of the poster, they may be seen only by other high-reputation users. 

Thus, a user with a high reputation for -'Firm I- could create a poll asking other 
F^nn /"people questions about the finn. After all. who better to know the best questions 
to ask than someone who works there? People with high reputations for "Firm 1 " might 
see the following poll the next time they login: 



Dafloaii asKs: "What's the worst thiiij"^ 
Firm 1 's new offices? 

□ not enough closet space 

D horrible shag carpeting 
D other 



When enough people have answered this poll. Daffodil will have the option of 

aJlowing others to see poll results. 

Allowing only users „i,h high rq,„u.d«,s ,„ pos, and am»er polls gives p«>ple a 
subs»„i.l „cen,.ve u. ,ry ,o ob.ain a high reputation. Giving them the <Uso«,ion to send 
such po.|s only to other high reputation users provides a way fcr high ,q,„.ati.n users ,„ 
comnrunrcate only among themselves, thus enhancing the prestige - and r.™, - of 

having a high reputation. 

Users who aehievc a high reputation may also publish articles. An arade is more 
complex than a message, and can contain images (such as graphs, am, other complex 
attachmews. More importantly, an article is posted in a prominor. am. fixed po^«on on a 
page, making users more likely to read articles than messages. 

As ^a, polls, allowing only „se„, with high reputations to write articles enhances 
people s desire ■„ obtain a high reputation. Since people raise the,, reputation by posting 
Sood contort to the site, this encourages the posting of interesting content 

The Presort invention has a "char option, but with a difference. Any user party to 
a ch« can choose to push the -reco„» button at any time. ,f a cha, is being recorded, a 
md hght appears in a comer of .he cha, window. Recorded chats can be posted to the 
system just hke messages. 
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Chats may be restricted to only high reputation users. Other users won't even be 
aware that a chat is taking place. Furthermore, when a chat is posted, it may take on the 
average reputation values of the users party to the chat. This encourages usere to invite 
only high reputation people to chat with them if they want their transcripts to be widely 



seen. 



Pending consideration of copyright-related concerns, the software can easily be 
modified to accept faxes. If this function is implemented, users will be able to fax 
documents to TRANSPARENSEE™ sites fiom any location. After the fax goes through, 
the user's fax machine will print a slip containing a confirmation number. 

The next time the user goes to the site they can receive the fax that they sent by 
clicking the "Receive Fax" button and entering the confirmation number. The fax will 
then appear on the user's screen and can be posted to the system. It is not necessary to 
login to receive a fax, and faxes can be posted to the system using any level of anonymity. 

Again, pending consideration of copyright-related concerns, the software can be 
modified to accept Word files, photos, and video clips. Just as posting a fax can 
demonstrate one's insider status and raise one's reputation, so can posting an interesting 
file, photo, or clip. 

One of the greatest advantages of the Present invention lies in the filter selection 
mechanism. It feeds information to users as they make choices, allowing them to extract 
information fmm the database on areas they may know little about. 

Consider a law student trying to decide which firms to interview with. The student 
knows nothing about law finns, but knows that he would like to work at a firm with 
offices in New York, Palo Alto and London. 

To obtain information, the student would set his filters as follows: 



I Industries=Law 



Locations=New York 
Palo Alto 

i_ London 

If h e now selects the "Firms" filter, the system will show him a list of law fimis with 
offices in New York, Palo Alto and London. The list might look like this: 



Firms=Brobeck, Phleger 
Coudert Brothers 
Davis Polk 
Gibson Dunn 
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Morrison & Foerster 
Shearman & Steriing 
Skadden Arps 
White & Case 



These are all law firms with offices in New York, Palo Alto and London. On a 
conventional bulletin-board system the user would have had to determine for himself 
which finns have offices in all three locations. This could take hours, but only after doing 
this research would he know which boards are of interest to him. On a 
TRANSPARENSEETM system, the relevant firms are pre-selected. 

J. Wireless Clients Supported 

The Present invention has been built to accommodate multiple front-ends. Thus, as 
wireless PDAs (such as Palm Pilots and Blackberries) become more commonly available, 
a front-end can be provided to make TRANSPARENSEE™ sites accessible from such 
devices. 

The present invention will be particularly useful for PDA users, since the small 
screen and low bandwidth of PDAs places a premium on the ability to retrieve high quality 
15 information quickly. Discussion boards, in their current form, will be virtually 
inaccessible from PDAs due to the amount of time it takes to find worthwhile information 
on them, even when using a high bandwidth client. By eliminating low quality 
information from such boards, the Present invention will make PDAs a viable device for 
the exchange of information between large numbers of online users. 

Although preferred embodiments of the invention have been disclosed for 
illustrative purposes, those skilled in the art will appreciate that many additions, 
modifications and substitutions are possible without departing from the scope and spirit of 
the invention as defined by the accompanying claims. 
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I Claim: 



1. A method for searching a content database stored in computer storage, 
the database including a plurality of records each containing multiple fields of 
information, the method comprising the steps of: 

maintaining a structure database in computer storage in which each record 
is parsed into a plurality of categories containing multiple fields of information, each 
category having at least one sub-category, the sub-categories being structure in at least 
one sub-level based upon a relationship between the infomiation in sub-categories, the 
stmcture database containing category structure information defining the data structure of 
each category; 

receiving a search query containing a subset of the categories contained in a 
record, which subset may include all of the categories, the query containing selection 
mfomiation indicating those sub-categories that should be present in recoixls being sought 
by the query; 

for categories present in the query subset, performing a correlation between 
the selection information and information fi-om the stmcture database representing the sub- 
category structure in a subset of the records, which subset may contain all of the records, 
to produce a relevance value for a record; and 

as a response to the query, selecting information from records in the content 
database based upon the relevance value of the records. 

2. The method of claim 1 wherein the correlation step is perfomied by 
correlating selection information for a chosen category with information from the structure 
database representing the sub-category structure of the same category in a subset of the 
records, which subset may contain all of the records, to produce a relevance value for the 
chosen category in each record of the subset, this correlating step being performed for a 
plurality of chosen categories; and, for a record, combining the relevance values of the 
plurality of chosen categories to produce a relevance value for the record. 

3. The method of any preceding claim wherein the conelating step is 
different for two categories that have different data stmctures. 
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4. The method of any p receding c laim wherein performing a c orrelation 
includes assigning a weight to a sub-category in a record depending on whether that sub- 
category is present in the selection information, and using the assigned weight in the 
correlating step. 



5. The method of any p receding c laim wherein performing a c oirelation 
includes assigning a weight to a sub-category in a record depending upon its level, and 
using the assigned weight in the correlating step. 

6. A system for searching a content database stored in computer storage, 
the database including a plurality of records each containing multiple fields of 
information, the method comprising: 

a structure database in computer storage in which each record is parsed into 
a plurality of categories containing multiple fields of information, each category having at 
least one sub-category, the sub-categories being structured in at least one sub-level based 
upon a relationship between the information in sub-categories, the structure database 
containing category structure infomiation defining the data structure of each category; 

a receiver for a search query containing a subset of the categories contained 
in a record, which subset may include all of the categories, the query containing selection 
information indicating those sub-categories that should be present in records being sought 
by the query; 

a correlation device set performing a correlation between the selection 
information, for categories present in the query subset, and information from the structure 
database representing the sub-category structure in a subset of the records, which subset 
may contain all of the records, to produce a relevance value for a record; and 

a unit responding to the query by selecting and providing information from 
records in the content database based upon the relevance value of the reconls. 

7. The system of claim 6 wherein the correlation device coirelates selection 
information for a chosen category with information from the structure database 
representing the sub-category structure of the same category in a subset of the records, 
which subset may contain all of the records, to produce a relevance value for a chosen 
category in each record of the subset, this correlation being performed for a plurality of 
chosen categories; and, a correlation device including a component combining the 
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relevance values of the plurality of chosen categories to produce a relevance value for the 
record. 



8. The system of claim 6 or 7 wherein the correlation device is performs a 
different operation for two categories that have different data structures. 

9. The system of any of claims 6-8 wherein the correlation device includes 
a component which assigns a weight to a sub-category in a record depending on whether 
that sub-category is present in the selection information, and the correlation unit uses the 
assigned weight in performing the correlation. 

10. The system of any of claims 6-9 wherein the correlation unit includes a 
component which assigns a weight to a sub-category in a record depending upon its level, 
and the correlation unit uses the assigned weight in performing the correlation. 

11. The system of any of claims 6-10 provided with access to a network, 
the content database being accessible from the network, the receiver and responding unit 
communicating over the network. 

12. The system of claim 11 wherein the content database is accessed 
through the network. 

13. In an online user forum of the type permitting communication among a 
plurality of users and also permitting users to post information content for access by users, 
the improvement comprising a reputation module storing a reputation rating for a user in 
association with information content, a user's reputation being a function of the degree his 
participation in the forum. 

14. The forum of claim 13 wherein the reputation module is constructed so 
that a first user's reputation rating is dependent upon the evaluation by other users of the 
information posted by the first user. 
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15. The forum of claim 13 or 14 wherein the reputation module is 
constructed so that a first user's reputation rating is dependent upon his evaluation of 
information posted by other users. 

16. The forum of claim 14 or 15 wherein the reputation module is 
constructed so that a first user's reputation rating is dependent more the evaluation by 
other users of the information posted by the first user than upon the first user's evaluation 
of information posted by other users. 

17. In combination with the forum of any one of claims 13-16, a system for 
searching a content database stored in computer storage, the database including a plurality 
of records each containing multiple fields of information, the method comprising: 

a structure database in computer storage in which each record is parsed into 
a plurality of categories containing multiple fields of information, each category having at 
least one sub-category, the sub-categories being structured in at least one sub-level based 
upon a relationship between the information in sub-categories, the structure database 
containing category stracture information defining the data structure of each category; 

a receiver for a search query containing a subset of the categories contained 
in a record, which subset may include all of the categories, the query containing selection 
information indicating those sub-categories that should be present in records being sought 
by the query; 

a correlation device set performing a correlation between the selection 
information, for categories present in the query subset, and information fi-om the structure 
database representing the sub-category structure in a subset of the records, which subset 
may contain all of the records, to produce a relevance value for a record; and 

a unit responding to the query by selecting and providing information from 
records in the content database based upon the relevance value of the records. 

18. The combination of claim 17 wherein the correlation device correlates 
selection information for a chosen category with information from the structure database 
representing the sub-category structure of the same category in a subset of the records, 
which subset may contain all of the records, to produce a relevance value for a chosen 
category in each record of the subset, this correlation being performed for a plurality of 
chosen categories; and. a correlation device including a component combining the 
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relevance values of the plurality of chosen categories to produce a relevance value for the 
record. 



19. The combination of claim 17 or 18 wherein the correlation device is 
performs a different operation for tv^o categories that have different data structures. 

20. The combination of any of claims 17-19 wherein the correlation device 
includes a component which assigns a weight to a sub-category in a record depending on 
whether that sub-category is present in the selection information, and the correlation unit 
uses the assigned weight in perforaiing the correlation. 

21. The combination of any of claims 17-20 wherein the correlation unit 
includes a component which assigns a weight to a sub-category in a record depending 
upon its level, and the correlation unit uses the assigned weight in performing the 
correlation. 

22. T he c ombination of a ny o f c laims 1 2-2 1 p rovided w ith access t o a 
network, the content database being accessible from the network, the receiver and 
responding unit communicating over the network. 

23. The system of claim 22 wherein the content database is accessed 
through the network. 

24. A method for improving an online user forum of the type permitting 
communication among a plurality of users and also permitting users to post infomiation 
content for access by users, the method comprising maintaining a reputation rating for a 
user in association with information content, a user's reputation being a function of the 
degree of his participation in the forum. 

25. The method of claim 24 wherein a first user's reputation rating is 
dependent upon the evaluation by other users of information posted by the first user. 

26. The method of claim 24 or 25 wherein a first user's reputation rating is 
dependent upon his evaluation of information posted by other users. 
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27. The method of anyone of claims 24-26 wherein a first user's reputation 
rating is dependent more the evaluation by other users of information posted by the first 
user than upon the first user's evaluation of information posted by other users. 



wo 2004/044705 



PCT/US2003/036045 




FIG.l 



1/31 



• • • 



wo 2004/044705 



PCT/US2003/03604S 



I Architectural Overview 



Content Indexing 
Service 



Web Frontend 



User 
Manager 




Content 
Manager 






Reputation 
Manager 




Rating j 
Manager J 



Calculation Engine 




Store 



Admin 
Frontend 



Category 
Manager 




Update Service 



FIG, 2 

2/31 



wo 2004/044705 



PCT/US2003/036045 




FIG. 3 



3/31 




wo 2004/044705 

PCT/US2003/036045 




FIG. 4 



4/31 



wo 2004/044705 



PCT/US2003/036045 



(Retevance Calculation 





Rdlavanco 
Calculator 



Relevance 
I0.1J 



Legend 



O 

' selected category 
■ root category 

— - parent-child category link 

— -cfossrootcategofylifik 



FIG. 5 



5/31 




FIG. 6 



6/31 



wo 2004/044705 



PCT/US2003/036045 



Content RaUng 



Improw* ratac*« 





ter*a reputation 




mlani rntln^ 




COnipar* >oilng wim 
otitor mtttni* raunoB 




tar to ocTkM' ratM 


n 





Upd!at» fx>«t*r*a 



FIG. 7 



7/31 



wo 2004/044705 



PCT/US2003/036045 



Aggregate Reputation Calculation 
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•Aggregate Reputation 
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Aggregate Reputation — 
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Post Annotation 



Userselecls 
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new annotation for 



Portion of 
selected 



AnnolBlIon 
contents 



Annotation posted to system 
as new content that is also 
Onked to Ihe sdecled portion 
of ihe original eonteot 



FIG. 15 



15/31 



wo 2004/044705 



PCT/US2003/036045 



^^h'tnl" associatesget treiT^^ 

responsibility and the chance to work with 

prestigious clients from Time-Warner to the New 





Firm: ''Firm J*' 
Topic: Hours 

Compensation 
Prestige 



Firm: "Firm J" 
Topic: Clients 
Responsibility 



Firm: "Firm J" 
Location: New York, NY 
Topic: Hours 

Partners 
Partner: John Doe 
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If you don't mind long hours and an antiquated, 
iockstep compensation system, there is no 
belter place to start your legal career than "Firm 
1 . Firm 1" associates get tremendous 
responsibility and the chance to work with > 
prestigious cIlentsfromTlme-Warner to the New 


since the annotated 
sentence is tagged with 
"Firm 1", "Hours" 
"Partners", and "John 
Doe"...^ 


That's not true.! IVe worked for John, and that 
reputation Is undeserved. In fact, John's 
associates billed an average of 2300 hours each L 
last year - the lowest in the firmi ^ 


...the annotation is tagged 
with the same filter 
values. 
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Posting Content at Different Anonymity Levels 
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Anonymity Level Properties • Posting Content 
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^Onion Routing 
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Aggregate Discussion Rating Calculation 
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messages 
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discussion 
rating 



FIG. 21 



21/31 




22/31 



• ■ m 



wo 2004/044705 



PCT/US2003/036045 





2? 



Multiple Choice 

Gender 



Height 



Weight 



Eye vision 



Eye color 



Hair color 



Race 



Location 



Children 



Children at 
home 




Under 5ft 
5ft.5ft2 
5ft2-5ft4 
5ft4-5ft6 



Under 100 lbs 
100- 110 lbs 
110- 125 lbs 
125 - 140 lbs 



No eyewear 
Reading Glasses [ 
Great Glasses 
Cool Contacts 



Blue 


m. 


Green 


IS 


Hazel 




Brown 


0 



Blonde 
Dark Blonde [ 
Brown 

Brunette 



White-Caucasian i 
Asian 

African Descent 
Hispanic-Latino 



North-East US 
East-Coast US 
North-West US 
Central US 



Age 



Marital status 



Education 



Occupation 



Salary 



Hair length 



.Body type 
(ladles) 



Body type 
(gentlemen) 




Smoke 



Drink 



18-20 
21-25 
26-30 
31-35 



Never Married |g 
DIvonced 
Widowed I 
Involved ' { 



Self-taught 
High School 
Technical -Training I 
Bible Coirefle 



Executive 
Adminlstrath^e 
Financial 
Government 



Not listed 
Under 15K 
15-25K 
25-40K 



Long curfy/wavy 
Long straight 
Medium curly/wavy | 
Medium straight 




Thin - Lean 
Toned -Athletic! 
Muscular 
Average 



Yes 
No 

On occasion 
Trying to quit 



Never 


m 


Rarely 




Socially 




Regularly 
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3 



CO 






lis 


CO 


IIS 


1 Salary 1 




• ^ 

in o 


Over 
lOOK 


' Bye color! 


Blue 


Blue 


Blue 


Occupation 1 




Professional 


Legal 
Services 


Eye vision 




Reading 
Glasses 


No 

eyewear 


Education 


Masters 


Masters 


Masters 


Weight 


* 

*H Pi w 


110- 

125 

lbs 


o in ,^ 
W ^ £ 


tatus 








Marital si 


Never 
Married 


Never 
Married 


Never 
Mamed 


Height 


in m 


in m 


in in 


Age 


m CO 


• 

n fo 


^ m 


l[Gender| 


Female 


Female 


Female 


S 


Selection 


1 
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Enhanced Search results 

Selection: Welght=[ 110-125 lbs / 50-57 kg] Education^ (Masters] Gender^ [Female] Eye color«>[Blue] Helght«r5ft2 • 
5ft4/ 1.57m-1.63m] Hair colors [Blonde] Age==[31-35] Marital status=[Slngle-Never Married] 



Time taken: 3.065 seconds 
Page: M 



Displaying 1-20 of 115401 users 



Id 


Gender 


Age 


Height 


Marital status 


Weight 


Education 


Eye vision 


Occupation 


Eye color 


Sa 


Selection 


Female 


31- 
35 


5ft2 - 
5ft4 


Never 
Married 


110 - 

125 

lbs 


Macf'prc 






Dlue 




bobou 


Female 


31- 
35 


5ft2 - 
5ft4 


Never 
Married 


110 - 

125 
lbs 


Masters 


No 

eyewear 


Legal aervices 


Blue 


ov 
10* 


tieldildtcb 


Female 


31- 
35 


5ft2- 
5ft4 


Never 
Married 


110 - 

125 

lbs 


Masters 


Glasses 


r 1 uitfSSIOnol 


Blue 

> 


25 


LANDDECKER 


Female 


31- 
35 


5ft2- 
5ft4 


Never 
Married 


125- 

140 

lbs 


Masters 


fcool 
Contacts 


Financial 


Blue 


No 
yot 
bu 


eacb 


Female 


31- 

35 


5ft2- 
5ft4 


Never 
Married 


125- 

140 

lbs 


Masters 


No 

eyewear 


Professional 


Blue 


40 


Julie 


Female 


31- 
35 


5ft2 - 
5ft4 


Never 
Married 


125- 

140 

lbs 


Masters 


No 

eyewear 


Professional 


Blue 


40 


Helena 


Female 


31- 
35 


5ft2- 
5ft4 


Never 
Married 


100- 

110 

lbs 


Masters 


No 

eyewear 


Executive 


Blue 


No 

yoi 

but 


tooky 


Female 


26- 
30 


5ft2- 
5ft4 


Never 
Married 


110- 

125 

lbs 


Masters 


Great 
Glasses 


Financial 


Blue 


40 


Isdd 


Female 


26- 
30 


5ft2- 
5ft4 


Never 
Married 


110- 
125 

lbs 


Masters 


No 

eyewear 


Musician 


Blue 


No 
list 




Female 


36- 
40 


5ft2- 
5ft4 


Never 
Married 


110- 

125 

lbs 


Masters 


Cool 

Contacts 


Teacher 


Blue 


25 


freedom | 


Female 


26- 
30 


5ft2- 
5ft4 


Never 
Married 


110- 

125 

lbs 


Masters 


No 

eyewear 


Unsure 


Blue 


25 


BondHwro II Female |j 26- ||5ft2 - || 


Never ||^^**' | 


Masters j|No [[^ 


Executive || 


Blue ^[^ 


Pel 
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jao 


5ft4 


parried 




1 


eyewear 11 




1 


catscratch 




r 


5fk4 


Married 


i 110- 
125 
lbs 


Masters 


No 

eyewear 




Blue 


No 

list 


blondcchick 


Female 


hi- 
ps 


5ft2- 
5ft4 


Never 
Married 


110- 

125 

lbs 


College * 
University 


Great 
Glasses 


Legal Services 


Blue 


No 

list 


Kay 


Female 


bi- 
as 


5ft2 - 
5ft4 


Never 
Married 


110- 

125 

lbs 


College - 
UnlversltY 


No 

eyewear 


Professional 


Blue 


15 


anvdavnow 


Female 


al- 
as 


5ft2- 
5ft4 


Never 
Married 


110- 

125 

fbs 


College - 
University 


No 

eyewear 


Technical 


Blue 


No 
bu 


hopethall 


Female 


al- 
as 


5ft2- 
Sft4 


Never 
Married 


110- 

125 

lbs 


College - 
University 


Great 
Glasses 


Administrative 


Blue 


No 
list 


sarasotaboiinri 


Female 


31- 

as 


5ft2- 
5ft4 


Never 
Married 


110- 

125 

lbs 


College - 
University 


Cool 

Contacts 


Rnancial 


Blue 






No 

yoi 

bu: 


Colleen 


Female 


al- 
as 


5ft2- 
5ft4 


Never 
Married 


110- 

125 

lbs 


College - 
University 


Cool 

Contacts 


Executive 


Blue 


60 




Female 


al- 
as 


5ft2- 
5ft4 


Never 
Married 


110- 

125 

lbs 


College - 
University 


No 

eyewear 


Sales 


Blue 1 


40 


shell 


Female 


al- 
as 


5ft4- 
5ft6 


Never 
Married 


110- 

125 

lbs 


Masters 


No 

eyewear 


Medical 


Blue 1 


40 
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Vitals : Landk 
Create date 2 
Rrst name 
Address 

aty 

Country 
Birth month 
Birth year 
Heard from text 



002-02-10 



San Bernardino 
United States 
1 



Create time 


19:52:32 


Last name 




Apt 




Province 


Callfbmfa 


Zip code 


92423 


Birth day 


21 


Heard firom 


28 




Long Answera 

What are some of your personal goals In life? 



What would be the perfect setting for a first 
mee^ng with a new friend or date? Describe 

Describe your Christian faith. Where are you 
at and what are your goals? What Scripture 
verses are Important to you? 

Describe a little of your personality and 
character traits. (Are you fiinny, laid back, 
more serious, open, shy, etc.?) 

Describe your current occupation. Do you like 
what you do? What is your dream Job? 

Where were you bom? Have you ever 
traveled? If so, where? What Is your Ideal 
adventure? 

What are your fiavorlte acUvlUes? (Including 
sports, leisure, arttsti^muslcal, etc) 

Where did you go to school (I.e. high school 
or college/university, etc.)? What did you 
study? Old you like it? Would you like to do 
more? 

Describe the type of relationship you are 
looking for. What qualities would you like In 
this relationship or person? 

What caused your last relattonship to end? 



To please God, spend time with my femlly, find the man he has 
chosen for me, enjoy life on this earth to the fullest, and rejoice In 
heaven. 

Coffee, burgers, church- I'm up for anything!! 



I am learning on strengthening my faith right now, living In God's 
will, and preparing for his coming. 

I am serious alot of times but also have a very fiinny side when with 
good company 

nL^^^}S^ 'L'" MarlceOng. I also work with databases, i 
nnd this field both challenging and rewarding. I am always striving to 
do better and move up the ladder. 

Boot on the east coast, raised In Texas and Arizona, Ih^ed In 
Germany, and California. Love to settle somewhere where there are 
stars In the sky, big open fields and dean air. Been traveling since 1 
was a baby and recently went to Japan, 

not a big sports fan but like Just about everything else 
University of Redlands- Business and Management 



L^o f ^^^'^ ^^^^ * ^^'"•^ I know when 1 find 

t. But definitely must be Caucasian, tall, good-looklng gentelman who 
Is neat, clean-cut, sincere, likes to b-avel, has a sense of humor, and 
a follower of Christ. They must also love 

My mom died about 2 years ago and I rushed Into a relationship too 
27/31 
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Vitals : LubaO 
Create date 
First name 
Address 
Cfty 
Country 
Blith month 
Birth year 
Heard from text 



2001-10-30 Create time 04:52:20 
Last name 
Apt 

Biysk Province 
Russia Zip code 
3 Birthday 

Heard from 50 



Outside US, Canada and AustaraNa 
659302 
23 



Most similar users 

search: 

Stullone 

wlldemesggal 

riaughteroftfteKIng 

Laodk 

dare 

SmniBfic 



Multiple Choice 



Gender 


Female 


Age 


36-40 


Height 


5ft4 • 5ft6 


Marital status 


Divorced 


Weight 


125 - 140 lbs 


Education 


College - UnNerslty 


Eye vision 


Great Glasses 


Occupation 


Computers 


Eye color 


Blue 


Salary 


Not listed 


Hair color 


Blonde 


Hair length 


Medium curly/wavy 


Race 


White-Caucasian 


Body type (ladles) 


Average 


location 


Russia-OS 


Body type (gentiemen) 


Thin - Lean 


Children 


■ 1 


. Smoke 


No 


Children at home 


Yes 


Drink 


Sodally 


Want children 


tindedded . 


Pets 


None currently 


Christian faith 


I believe In God 


. Personality type 1 


Emotional 


Denomination 


Just Christian 


Quirks habits 


Nothing serious 


Church Involvement 


Attend occasionally 


Fashion sense 


Elegant 


Mood 


Usually cheerful 


Personality type 2 


Generally organized 


Sense of humor 


Enjoy a good joke or story 


Favorite music 


Classic Rock 


.Favorite outdoor activity 


Traveling 


T^nsportation 


Economy car 


Travel 


Would like to see more 


Describe looks 


Above average 


Describe Intellect 


Smart 


Reading material 


Adventure 


Relationship kind 


Marriage minded 
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BEST 



AVAILABLE COPY 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 



Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SIDES 



4^3 FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not rejport these problems to 
the IFW Image Problem Mailbox. 



BEST AVAILABLE IMAGES 




